A First Investigation on Mongolian Information Retrieval
نویسندگان
چکیده
In this paper we present an attempt to build a test collection for Mongolian IR as well as some preliminary tests about the key issues in Mongolian Information Retrieval: using a stoplist and using word stemming. Our preliminary tests will show that while these basic operations on Mongolian can bring slight improvements in retrieval effectiveness, many problems remain. The results using stemming and stoplist show that the stemming can potentially lead to some gain in retrieval effectiveness; The stoplist slightly improve retrieval effectiveness, but it can reduce the index significantly.
منابع مشابه
Design and Realization of Mongolian Syntactic Retrieval System Based on Dependency Treebank
In the past seven years, Language Research Institute of Inner Mongolia University has constructed a 500,000word scale Mongolian dependency treebank. The syntactic treebank provides a favorable data platform for language research and information processing. In order to effectively use the treebank, we have designed and implemented a graphical syntactic information retrieval system based on the M...
متن کاملThe Study on Key Technology of Mongolian Full-Text Retrieval
With the development of the Mongolian corpus and website, an increasing number of people have focused their attention on the accurate, complete and fast retrieval of the information that they need. In this paper, such key technological issues in Mongolian full-text retrieval as character shape indexing, drawing the Mongolian verb stem and the automatic recognition of the Mongolian homographic w...
متن کاملResearch on Reasoning and Retrieval Methods Based on Mongolian Curriculum Areas of Semantic Web
The backwardness of the Mongolian network teaching resources results in its low reuse rates and utilization. For this situation, a retrieval method of semantic web based on Mongolian curriculum areas was set up. Firstly, the method established the Mongolian ontology of course ‘Artificial Intelligence ( )’in area of teaching, it uses a relationship database MySQL to record ontology information, ...
متن کاملA Lemmatization Method for Modern Mongolian and its Application to Information Retrieval
In Modern Mongolian, a content word can be inflected when concatenated with suffixes. Identifying the original forms of content words is crucial for natural language processing and information retrieval. We propose a lemmatization method for Modern Mongolian and apply our method to indexing for information retrieval. We use technical abstracts to show the effectiveness of our method experimenta...
متن کاملThe Semantic Annotation Based on Mongolian Place Recognition
The Mongolian semantic description is the chief problem in construction of Mongolian semantic web. Particularly, more and more Mongolian websites have been created in recent years, and the number of users is sharply increasing, both of them demand for higher quality of Mongolian information retrieval. This paper is based on the study of Mongolian place name recognition and semantic annotation. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008